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We report a new statistical general property in traveling salesman problem, that the nth-nearest- 
neighbor distribution of optimal tours verifies with very high accuracy an exponential decay as a 
function of the order of neighbor n. With defining the energy function as the deviation A from this 
exponential decay, which is different to the tour length d in normal anneahng processes, we propose 
a distinct highly optimized annealing scheme which is performed in A-space and d-space by turns. 
The simulation results of some standard traveling salesman problems in TSPLIB95 are presented. 
It is shown that our annealing recipe is superior to the canonical simulated annealing. 

PACS numbers: 02.60.Pn, 02.70.Uu, 07.05. Tp, 89.20.Ff 

The traveling salesmanproblem (TSP) is stated for the shortest closed tour for a traveling salesman who must visit 
each of N cities in turn The number of candidate set is N\/2N. So it is very difficult to find an efficient 

algorithm for large N because it is bounded by a polynomial function with the problem size. For example, Padberg 
and Rinaldi obtained the optimal solution, i.e., the shortest path, for 532 U.S. cities after 6 hours of calculation with 
the use of the supercomputer Cyber 205 in 1987 3- And, the TSP is a classic famous nondeterministic polynomial 
problem (NP-complete) and a good testing ground for optimization methods. Because the exact solutions are almost 
impossible to obtain, the realistic and valuable aim is to seek for a near-optimal solutions. 

In recent several decades, TSP has attracted great attention of many physicists, and a lot of algorithms with physical 
insight had been proposed. In 1982, Hopfield and Tank used neural networks to find the approximate solution (Sji , and 
since then, simulated annealing (SA) [^Sl^i hierarchical constructioncite 8], real space renormalization j9lllol lll| . 
diffusion process ^tc. had been introduced to challenge TSP. 

Among these algorithms from statistical physics, the most general powerful solution is simulated annealing scheme 
0, • The classical simulated annealing (CSA) was proposed by Kirkpatrick et al. ^ that it extended the Metropolis 
procedure for equilibrium Boltzmann-Gibbs statistics. Moreover, it was shown that the system will end in a global 
minimum if the temperature decreases as the inverse logarithm of time [l^ . This annealing process allows the 
exploration of the configuration space avoiding trapping into the local minima of energy function. However, in fact, 
since the configuration space is bumpiness, CSA has to use higher initial temperature and longer annealing time to 
escape from the energy valleys. So the open question is how to get away from the energy valleys more efficiently. Then 
some complex methods had been proposed. Szu and Hartley proposed the so called fast simulated annealing (FSA) 
that the system can jump around the energy landscape due to a Cauchy-Lorentz visiting distribution instead of the 
Gaussian in CSA The subsequent generalized version presented by Tsallis and Stariolo from Tsallis statistics 

and it was applied to TSP for faster stochastic method of SA . Another effective advanced recipe is quantum 

annealing (QA) j3 where the quantum fiuctuations instead of thermal fluctuations to tunnel through energy 
valleys. 

However, in this paper, different than the traditional solutions to optimize the escape from the local energy minima, 
we focus our attention on how to construct a new flatter conflguration space where the annealing process should be 
easier, faster, and more efficient to reach the final near-optimal or optimal solutions. 

Our recipe of TSP can be simply stated as the following description. Noted that there are a large number of optimal 
solutions of TSPs in TSPLIB95 Through summarizing the statistical property of the nth-nearest- neighbor 

distributions (NNDs) of optimal tours of TSPs in TSPLIB95, it is found that the distributions decay exponentially 
with very high accuracy as a function of the order of neighbor n. And, we defined a new energy function and 
constructed a new configuration space under the name of A-space, where A describes the deviation between the 
tour's neighbor distribution and the general decayed exponentially statistical property. A new simulated annealing 
is accomplished in A-space and, further, is executed in A-space and d-space by turns. This solution is considerably 
faster than the traditional SA in solving the TSP. It is indeed so, as shall be shown in the following text. 

One considers a set of A'^ cities with intercity distance dij between the «th city and j'th city. In this work, the 
problems are limited to the symmetric TSPs, dij — dji. Especially, the dij = oo if there is no link between the «th 
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FIG. 1: (Color online) (a) the nth-NND r{n) of optimal tours from att48 problem to rl5934 problem in TSPLIB95 vs the 
neighbors' order n. (b) The average {r{n)) (circles) and the fitting result P{n) (solid line) as functions of n. A perfect 
exponential decays of P(n) vs n is shown with P(n) = 0.74475 x e'"/'^-'"^^^^ + 0.000771. 



node and jth node. The order of neighbor n is defined by the nearest neighbor sequences. For example, the nearest 
neighbor is corresponding to n = 1. n = 2 means the edge dij is the second shortest edge among all edges connecting 
with city j, or city i is the second nearest neighbor of city j. The rest, n = 3,4, . . . , N — 1, may be deduced by 
analogy. For every possible tour of TSP, the nth-NND r(n) should be defined by 

rW = ^, n = l,2,3,...,N-l, (1) 

where s{n) is total number of the nth-nearest-neighbor for all links in this tour. Clearly, the nth-NND r(n) e [0, 1] 
and 1^(1^) = 1- Idealistically, the best economical tour routine is that every link between two cities is shortest, 

or it only exists the nearest neighbor, r(l) = 1. Apparently, it is impossible found this extreme condition in realistic 
TSPs. 

In Fig. n^a), we presented the nth-NNDs r{n) of optimal tours of many problems in TSPLIB95. It appears that 
all r(n) can be qualitatively regarded as an exponential decay as a function of the order of neighbor n. And, it is 
should be pointed out that this qualitative exponential decay was confirmed by almost all TSPs in TSPLIB95. Then, 
the average nth-NND (r(n)) is given in Fig. ^b). As a result of nonlinear fitting, the general empirical rule of the 
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FIG. 2; (Color online) The neighbor distributions r(n) of some approximate tours for att48 problem which have 10% ~ 20% 
longer than the optimal tour length. The solid line is the general empirical neighbor distribution P(n) of optimal tours from 
Eq. (HI. 

?ith-NND is represented by 

P{n) = Ax e-''/^ + C, n = 1, 2, 3, . . . , iV - 1. (2) 

Here A = 0.74475 ± 0.01376, B = 1.76985 ± 0.03785, and C = 0.000771 ± 0.00129. Noted that P{n) is introduced to 
denote the general empirical property, but {r(n)) is the statistical result. From the above considering, the nth-NND 
of optimal tour for any TSPs should be consistent with Eq. On the contrary, it is only stated that a tour with 
similar distribution of Eq. ^ may be not optimal, but it must be an approximate tour. 

An example of the comparison between r(n) of the near-optimal tours and P{n) for the att48 problem is given in 
Fig. 121 It is clear that there exists distinct deviation of the near-optimal r{n) from the corresponding statistical P{n). 
The difference of the rtth-NND between any tours and the general empirical optimal tour Eq. is defined as the 
following equation, 

n 

A = ^ (s„ - Sn) X n, (3) 

i=l 

where s„ is the sum of number of the nth neighbor in this tour, and s„ = P{n) x 2N is the empirical sum for the 
corresponding optimal tour from Eq. ^ and Eq. Another important parameter is the tour length d which is 
used as the energy function in most of simulated annealing algorithms, such as CSA, QA, etc.. Our annealing scheme 
is attempt to execute the Monte Carlo (MC) process to make the nth-NND r(n) of any initial tour getting close to 
the empirical distribution of optimal tour P{n). We will show the detail procedures in the following text. 

The first crucial problem is how to generate a tour which should have smaller A. The possible candidate solutions 
as we know are, tour construction algorithms, tour improving algorithms, and composite algorithms. The tour 
construction algorithms gradually build a tour by joining a new city at each step. The tour improvement algorithms 
improve upon a tour by performing various exchanges. And, the composite algorithms is a combination of the 
above two algorithms. Among these methods, tour improving algorithm is powerful and easy to implement. A simple 
example is the so-called 2-opt algorithm. For a given tour, the new tour with shorter length is constructed by replacing 
two links by another two links, and the following procedures go on this cooperation until no more improvement are 
possible. In simulated annealing, usually, the tour improving algorithm is embedded in Metropolis MC process to 
generate new tours at environment temperature. 

In our algorithm, the similar mechanism is used to optimize a given tour by metropolise MC process. We first 
generate a random tour and set up a temperature which will be reduced very slowly. For every step, a new tour is 
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FIG. 3: (Color online) The detailed annealing processes in d-space annealing (SA) and A-space (OA) for eillOl problem in 
TSPLIB95. The insets are the annealing details near optimal tour. 



reconstructed from the old one by 3-exchange algorithm |23|. If AA < 0, where AA = X^ew ^ X^id, or e^^/* greater 
than a random number between and 1, the new tour is accepted. Repeated applications of this MC step until A 
smaller than Ac, where Ac means the acceptable difference between the neighbor distribution of the final tour and the 
empirical optimal P{n) from Eq. Since it is impossible to offer the exact value of Ac for each TSP, we usually 
keep running the annealing step until the tour length d cannot get smaller. 

Note that both our recipe and classical simulated annealing are belonged to the annealing scheme. There are 
the same MC steps, but our energy function is A different from the tour length d in classical simulated annealing. 
Moreover, the corresponding configuration space is absolutely different. We label our annealing scheme as A-space 
annealing, and rename the canonical simulated annealing scheme as d-space annealing. 

To measure the performances of the two distinct annealing strategies, we chose a standard TSP sample, eillOl 
problem, from the TSPLIB95 To^. It is a iV = 101 cites problem constructed by Christofids and Eilon whose optimal 
tour length dopt = 629. Both A-space annealing and d-space annealing were applied to seek for the near optimal 
tours. Again, noted that both annealing schemes begin with a random tour and make use of the same 3-exchange 
algorithm. In Fig. O we present the detail annealing processes in d-space and A-space. The evolutions of tour length 
are described by Ad ~ x 100%. As it turns out, the final near optimal tour for A-space annealing approximately 

coincides with d-space annealing (see Fig. andl^b). But the A-space annealing is obviously faster than d-space 
annealing and noted that the initial temperature in A-space annealing is far lower than in d-space annealing. In other 
words, A-space annealing work at small initial temperature as well as at high initial temperature to make A of the 
tour smaller than the criterion Ac. In fact, even only 3-exchange technique can obtain a nice result, but it was highly 
dependent on the initial tours. 

It is well known that the initial configuration with small temperature often cause trapping into local minimum too 
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TABLE I: Annealing results in d-space, A-space, and (A + d)-space for some problems in TSPLIB95. 
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22.2% 
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3.5 X 10" 


29.0% 
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14.7% 


1901 


0.2 X 10® 


att532 


43.0% 


176575 
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46.0% 


15376 


1.0 X 10" 


28.0% 


20316 


0.2 X 10" 



early. From Fig. if someone uses d-space annealing to find a near optimal tour with same Ad resulted from A-space 
annealing, it should be paid out the higher initial temperature and the longer annealing time. So we conjecture that 
the configuration space in A-space annealing is smoother than that in d-space annealing, or the fluctuation in d-space 
annealing is stronger than that in A-space annealing. As a result, it is not necessary that one use the higher initial 
temperature to jump through energy valleys in A-space annealing. 

However, it is very important to mention that c?-space annealing can increase accuracy at the cost of higher initial 
temperature and longer annealing time, but A-space annealing lose this property because it almost produces the similar 
effect on annealing processes at higher or lower initial temperature for the more smooth A-space. It was concluded 
that A-space annealing can present approximate tours in shorter time, but very difficuh to improve its accuracy. 

In fact, the final tours obtained by A-space annealing is really different than the tours obtained by d-space annealing 
though both Ad are looked like the very similar value about 10% (see Fig. 12^^). Taking the eillOl problem as example, 
the A of the approximate tour in d-space annealing is 730 and the A in A-space annealing is 260 (several other examples 
presented in Table P). The results in A-space annealing have properties of small d and small A. It means that A-space 
annealing can making d of the tours close to the optimal tour as well as making the neighbor distributions r{n) 
of tours close to P{n). But in d-space annealing, the final approximate tours are restricted by the certain initial 
temperatures and their neighbor distributions are far from the distribution of optimal tour P{n). Consequently, at 
a certain temperature, the annealing processes in d-space is easy to be trapped into local minimum and it loses the 
potential to find better tours in normal annealing schemes. 

From the above statements, in d-space annealing, the tours usually have small d and large A at local minimum. 
Similarly, in A-space annealing, the tours normally have small d and small A, but they may be not in the local 
minimum of d-space at certain temperatures. It suggests that one can design a combinational optimized simulated 
annealing in both d-space and A-space. As an example, it should be has higher potential to capture better tours when 
we take the tour obtained from A-space annealing as the initial tour in d-space annealing at certain temperature. 
Theoretically, next, the annealing process can be moved onto A-space. Repeated this annealing process back and 
forth until the global optimal tour was found. Simulated annealing in A-space and d-space by turns is the point of 
our so-called optimized annealing scheme in this work. In fact, since the neighbor distribution of optimal tours P{n) 
is just a statistical rule, we perform a simple case of optimized annealing under the name of (A + d)-space annealing, 
which is constructed by A-space annealing and d-space annealing only one time in this paper. 

To get a comprehensive picture of the performance of our annealing schemes, we have calculated several problems 
in TSPLIB95, for N = 48,76,101,124,136,225,532. The annealing results in d-space, A-space, (A -t- d)-space are 
listed at Table |l| Besides the confirmation of the above statements about Ad, A, and annealing velocity for d-space 
annealing and A-space annealing, it is highly important that (A + d)-space annealing is impressive superior to other 
annealing processes at each performance index. 

In general, it is shown that our annealing recipe, especially (A -I- d)-space annealing, is far better than CSA. But it 
should be stated that the results are worse and worse with the growth of the city number TV. It is believed that the 
simple 3-exchange technique is not power enough to act on TSPs with large N. We emphasized the fact that, in this 
paper, we are not to pursue a perfect algorithm for the optimal tour, but to present a distinct and excellent annealing 
technique. 

In conclusion, for the symmetric traveling salesman problems, according to the statistical general property of the 
nth-nearest-neighbor distributions of the optimal tours, wc define a new configuration space, so-called A-space, where 
A describes the difference between the neighbor distributions of any tours with the general rule Eq. (0) . As indicated 
in simulation results, A-space annealing is more efficient than normal simulated annealing and the combination of 
A-space annealing and CSA is more powerful. Further studies would be needed to clarify more detail properties of the 
nth-nearest-neighbor distribution and to seek for more valuable recipes for any TSPs even in the case of large N. We 
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expect that this new general property Eq. (0) can be combined with other techniques for TSP or other optimization 
problems. 

Note that the similar distribution in the random link TSP and the Euclidean TSP is reported in Ref. |23 after we 
presented this work to axriv. 

We thanks Olivier Martin for pointing out reference |22j| . The work reported in this paper was supported by the 
National Natural Science Foundation of China under Grant No. 10305005 and the Special Fund for Doctor Programs 
in Lanzhou University. 
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